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Abstract 

In this study, we tested the outcomes of access to a Response to Intervention (Rtl) 
model in kindergarten or in first grade on end-of-Grade-2 reading achievement and 
placement in special education. Across five schools, 214 students who began having 
access to Tier 2 intervention in kindergarten or first grade were compared in Grades 1 
and 2 with 208 cohort peers who were average readers, and 102 historical control 
condition second grade poor readers who did not receive Tier 2 intervention. Results 
found significant effects on reading achievement for access to Rtl in kindergarten at the 
end of first grade (effects averaged 0.48), but not in second grade, except for students 
who were English Language Learners (ELL), who showed an advantage through the end 
of second grade. Students with access to Rtl overall had significantly higher outcomes at 
the end of Grade 2 than students in the historical control, with no differences due to ELL 
status. No significant difference in the proportion of students placed in special education 
was noted; however, a greater proportion of the students found eligible as LD had poor 
reading scores if they were placed after participating in Rtl. 
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Responsiveness to Intervention Outcomes: 

Does Beginning Intervention in Kindergarten Matter? 

The Individuals with Disabilities Education Improvement Act (IDEIA, 2004) 
encourages schools to adopt and implement multi-level prevention systems, known as 
response-to-intervention (Rtl) models. The motivation for RTI stems in part from 
dissatisfaction with the traditional method for identifying students with learning 
disabilities (LD) in reading, which involves a discrepancy between a student’s cognitive 
ability and achievement. Because the kindergarten and first grade expectations for 
reading skills are both limited (albeit no less important) and constantly increasing, this 
discrepancy LD identification method often misses students; students are not found 
eligible for intensive intervention through special education services until 2 nd grade or 
later. By delaying identification of LD, students lack opportunity for early intervention 
in reading, which often has positive effects on short- and long-term reading achievement 
(O’Connor, Fulmer, Harty, & Bell, 2005; Simmons, Coyne, Kwok, McDonagh, Harn, & 
Kame’enui, 2008; Vaughn, Linan-Thompson, & Hickman, 2003). 

A second motivation for Rtl is the prevalent belief that some students are 
identified for LD due to instructional failure, meaning that students demonstrate poor 
reading achievement due to instruction that was insufficiently rigorous at the general 
education level (in Rtl known as Tier 1). These students would not need special 
education if their reading instruction had been more careful, research-based (Luchs, 2003; 
O’Connor, 2000), or differentiated (Connor, Morrison, & Underwood, 2007). By 
providing small-group, explicit, and intensive instruction (also called Tier 2) and 
monitoring whether reading improves, Rtl models may help to differentiate students who 
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have need for a special education from those who would grow adequately with better 
instruction. 

Due to the change in the IDEA and the success of the early intervention studies of 
the last 15 years (e.g., Blachman, Tangel, Ball, Black, & McGraw, 1999; O’Connor et al, 
2005; Torgesen et al., 1999; Vaughn et al., 2003; Vellutino et al., 2006, 2008), many 
school districts have adopted implementation of Rtl as a means for preventing reading 
difficulties and assisting with identification of LD. These school-based models of Rtl 
differ from the experimental studies in several respects. First, experimental studies 
usually select a group of students based on risk criteria identified at one time point (e.g., 
beginning or middle of kindergarten, beginning of first or second grade). These students 
become the risk pool for intervention and control groups. Although useful for 
experimental control, this experimental design model ignores the fact that students 
exhibit risk at different time points and on different dimensions of reading. For example, 
many students have reading difficulties based on poor letter knowledge or phonemic 
awareness in kindergarten and would be included in a kindergarten risk sample; however, 
other students perform adequately on these kindergarten skills and still demonstrate 
difficulties decoding words in first grade. These students, whose reading difficulties stem 
from integration of multiple skills, would not be included in the risk pool identified with 
kindergarten measures; however, they are included in the more fluid, school-developed 
models of Rtl (Marston, Myskens, Fau, & Canter, 2003; Mellard, McKnight, & Woods, 
2009; Tilly, 2003). 

Schools that implement Rtl as policy include all at-risk students by screening 
multiple times each year and including students in intensive interventions (Tier 2) as their 
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scores fall below cut-points that indicate poor reading growth or achievement relative to 
their peers. In these practical applications of Rtl, student movement is fluid across Tiers 
of intervention over time. Thus, rather than identify a pool of students at risk and provide 
intervention for a specific length of time (e.g., 10 weeks, a semester, or a school year) 
regardless of growth, the school-based models admit and release students from Tier 2 
interventions flexibly based on their performance (e.g., see Marston et al., 2003 for a 
description of the Minneapolis implementation model, Mellard et al., 2009 for screening 
and selection procedures, and the Division for Learning Disabilities (2008) guide for 
teachers). 

This practical fluidity across Tiers changes the fundamental research questions 
from those used experimentally (e.g.. What are the outcomes of X treatment applied for 
10 or 20 weeks to students at risk? What proportion of children at risk is no longer at risk 
after tutoring? What proportion is identified for special education?) to those with 
practical importance to school and district personnel (In what grade should we start an 
Rtl system? What effect will this model have on students who are English Language 
learners? Does Rtl improve long-term outcomes for students? Does it improve 
identification for special education?) 

Beginning Tier 2 in Kindergarten or First Grade 

Research syntheses (e.g., the National Reading Panel, 2000; Snow, Bums, & 
Griffin, 1998) have identified underlying reading skills that can be mastered by most 
students in kindergarten or first grade, including letter-sound correspondences and 
phonemic awareness, which together contribute to a child’s understanding of how reading 
and writing function to capture spoken words, also known as the alphabetic principle. 
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Beginning intervention in these key areas before students experience substantial failure in 
their effort to learn to read may impact long-term outcomes, such that the severity and 
perhaps the incidence of LD in reading may be decreased (Coyne, Kame’enui, Simmons, 
& Harn, 2004; O’Connor et al., 2005; Wanzek & Vaughn, 2011). 

Evidence for this possibility began to accumulate in the 1990s, as researchers 
explored whether teaching kindergarten children letter-sound correspondences, phonemic 
blending and segmenting, and linking these areas together might improve their learning 
of words (O’Connor, Jenkins, & Slocum, 1995; Tangel & Blachman, 1992; Torgesen et 
al., 1997). These studies followed an Rtl model, in that kindergartners were screened and 
those most at risk were selected for small-group, intensive instruction (Tier 2). Outcomes 
of these studies were positive; however, the researchers also pointed out that selecting 
children for risk status in kindergarten tended to “overselect;” that is, students’ lack of 
experience can be confused with poor learning potential. Thus, some of the success of 
these early interventions could be attributed to selecting students who would have learned 
these skills without specific intervention. Nevertheless, results of the kindergarten 
experiments have been so consistent that it is now common in schools to screen students 
for difficulties in these early reading skills, even when schools are not specifically 
implementing an Rtl model of prevention. 

Using a similar focus on the alphabetic principle, other researchers have explored 
the benefits of first grade intervention (Cunningham, 1990; Denton et al., 2010; 
O’Connor, 2000; Vellutino, Scanlon, Zhang, & Schatschneider, 2008), either alone or in 
combination with kindergarten intervention. These studies have shown positive effects 
for a first grade start to Tier 2, as well as declining proportions of poor responders when 
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first grade intervention was provided to students who responded poorly to intervention in 
kindergarten, or whose end-of-kindergarten scores still indicated risk for poor reading 
outcomes. All three models (intervening in kindergarten, intervening in first grade, or 
providing intervention as needed through kindergarten and first grade) have shown 
positive effects on reading development; however, they have rarely been compared 
within a single study. 

Although the belief that earlier intervention is better than later intervention is 
common, not all studies of intensive intervention have found that earlier intervention 
improves rate of learning over later intervention, and many studies indicate that the 
accuracy of identification improves after kindergarten (Compton et al., 2010; O’Connor 
& Jenkins, 1999). Moreover, confining identification of at-risk readers to kindergarten is 
problematic; measures of early literacy used in kindergarten (e.g., letter naming and 
segmenting) are less sensitive for identifying later-emerging reading difficulties in 1st or 
2 nd grade than are measures that require reading and comprehending text (Catts, 
Compton, Tomblin, & Bridges, 2012). When Cunningham (1990) compared the reading 
achievement of students who received phonemic awareness intervention in kindergarten 
vs. first grade, first graders showed stronger outcomes in transfer measures of reading 
achievement (although experimental group kindergartners outperformed control group 
first graders on phonemic awareness tasks). Other studies that have included students in 
intervention across a range of grade levels in elementary school have found similar 
growth in reading between older and younger students. Lovett and Steinbach (1997), 
Rashotte, MacPhee, and Torgesen (2001), and O’Connor, Bell, Harty, Larkin, Sackor, 
and Zigmond (2002) developed and delivered reading intervention to students at three or 
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more grade levels and found no grade-level effect on reading improvement. Conversely, 
Dion et al. (2010) compared effects of enhanced Tier 1 instruction that began in 
kindergarten or first grade, and found that students benefited more from the earlier start 
to evidence-based instruction, although the effect was only for the lowest-skilled 
students. Tier 2 intervention was not part of that study’s design. 

Identifying students as struggling learners in kindergarten has a well-known 
downside mentioned earlier: measures tend to overselect the proportion of students who 
may be at risk and result in funneling costly resources unnecessarily to students who are 
not truly at risk for severe reading disabilities (Compton et al., 2010; Jenkins & 
O’Connor, 2002; Speece, 2005). In addition, studies of intervention lasting two or more 
years often report loss of skills over the summer months (O’Connor et al., 2005; 

Vellutino et al., 2008); thus beginning in kindergarten may mean spending time again in 
first grade on skills that appeared to have been mastered the year before. For these 
reasons, beginning in first grade could be more cost-effective than in kindergarten if the 
long-term outcomes are strong. 

The issue here is not to compare one with two years of ongoing Tier 2 
intervention, which has been done in other studies, but rather to offer access to 
intervention - as needed — in kindergarten or first grade. In Rtl models, students are 
assessed on a set schedule and as their scores indicate risk they are included in Tier 2 
interventions. Some students may develop risk for reading difficulties in 1 st grade and 
still score adequately (e.g., at or above the 25 th percentile) on kindergarten measures. For 
these students, having access to Rtl in kindergarten would not matter because they would 
not have been selected for Tier 2 intervention at that time. Other students with reading 
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difficulties in 1 st grade might have alphabetic or phonemic difficulties that would have 
been apparent at the beginning of kindergarten. For these students, access to Rtl in 
kindergarten might either eliminate their reading difficulty altogether, or reduce the 
difficulties they experience in 1 st grade. Therefore, in this study, students were provided 
with Tier 2 intervention as their scores on screening measures indicated risk (i.e., in fall 
or winter during any grade in which the student had access to the Rtl model). 

Likewise, students did not continue in Tier 2 for a set number of weeks. 

Screening assessments and progress monitoring measures advocated by researchers in Rtl 
are meant to capture risk at various time points. Thus, when students score above the cut- 
points that indicate risk for reading difficulties, they are meant to be returned to less 
intensive intervention, most frequently moving from Tier 2 (small group intervention) to 
Tier 1 (general education instruction only). In this study, when students scored above the 
risk cut-points for six weeks, they returned to Tier 1; however, we monitored their 
reading growth in Tier 1 and returned them to Tier 2 if they failed to thrive with the 
general education instruction alone. How this model’s fluidity affects our understanding 
of the emergence of reading disabilities, student response to instruction, and the 
proportion of reading disabled children within different populations, have received little 
research attention. 

Including English Language Learners 

In many areas of the country, the proportion of students who are English 
Language Learners (ELL) is increasing. In southern California, for example, schools 
commonly comprise 30% ELL or more, and these students often demonstrate 
disproportionate rates of reading difficulties in English (Kieffer, 2010; Lee, Grigg, & 
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Donahue, 2007). To attempt to reduce potential bias in identification of students from 
minority cultures, ELL, and males, VanDerHeyden, Witt, and Gilbertson (2007) 
implemented an integrated model of Rtl in five elementary schools using the System to 
Enhance Educational Performance (STEEP) model. The study focused on the referral 
process and special education identification when the model was used by school 
personnel. Overall, using Rtl as a process for identification decreased bias associated 
with gender and ethnicity; however, the authors did not evaluate RtFs effect on ELL 
identified for special education. 

Studies of early reading intervention in kindergarten and first grade have 
demonstrated positive effects for students who are ELL (Linan-Thompson, Vaughn, 
Prater, & Cirino, 2006; Quiroga, Lemos-Britton, Mostafapour, Abbott, & Beminger, 
2002). Nevertheless, educators find it difficult to determine whether young students who 
are ELL need intensive reading intervention, or need experience in the school system for 
their language in English to develop (Klingner, Artiles, & Barletta, 2006). Part of this 
difficulty is that word reading, language comprehension, and reading comprehension may 
develop unevenly for ELL (Jean & Geva, 2009). In a longitudinal study of ELL from age 
4.5 to 11, Mancilla-Martinez and Lesaux (2010) reported average word reading skills 
alongside very low language skills in kindergarten and first grade. By 5 th grade these 
children comprehended text at only the 2 nd grade level, despite adequate word decoding. 
This disparity was also found between word reading and meanings of words (Mancilla- 
Martinez & Lesaux (2011). Relative skill in decoding coupled with poor reading 
comprehension may also account for Crosson and Lesaux’s (2010) finding that reading 
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rate shows weaker relations with reading comprehension for ELL than for native English 
speakers by 5 th grade. 

Across these studies, researchers have recommended that oral language 
development in English be incorporated with skill interventions throughout the primary 
grades. This recommendation may be especially important in states such as California, 
where most instruction in public schools occurs in English. We reasoned that in an Rtl 
model that includes students who are ELL, conversation and use of text-centered 
language should be included in Tier 2 intervention. Because these interventions are 
conducted most often in small groups, students who are ELL may benefit from 
opportunities to learn and use the English language, in addition to assistance in learning 
to read in English (O’Connor, Bocian, Beebe-Frankenberger, & Linklater, 2010). 

Whether students who are ELL would make greater gains with earlier access to 
Rtl has not been explored. It is possible that students who are ELL could profit more 
than native English speakers from small group intervention that enables more opportunity 
to use the English language than whole class instruction offers typically. In particular 
students who are ELL may benefit from an earlier start to reading intervention and 
continued support as needed in first and second grades. 

Rtl and Special Education 

The key issues of whether Rtl affects the incidence of disability or reliability of 
identification in the primary years create additional analytic difficulties. Few studies 
have examined long-term implementation of an Rtl model (i.e., three or more years) and 
its effect on the identification of disability. O’Connor et al. (2005) demonstrated a 
downward trend in incidence of LD; however, the number of students in that study was 
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too small for statistical analysis. In a larger study that followed students through third 
grade (Wanzek & Vaughn, 2011), the proportion of students found eligible for special 
education following implementation of Rtl again showed a downward trend, but did not 
differ statistically. Similarly, VanDerHeyden et al. (2007) reported that although more 
students received attention from the school-based team through implementation of their 
model of Rtl, fewer students were referred for, and qualified for special education.. 

Long-term outcomes (end of second grade) of Rtl on special education are also 
difficult to judge due to attrition. For example, in Vellutino et al.’s (2008) study, only 
62% of the students who had received Rtl were available for testing three years after it 
began, and only 25% of the kindergarten students remained through Grade 3. Given the 
attrition reported in previous longitudinal studies (e.g., Coyne et al., 2004; O’Connor et 
al., 2010; Vellutino et al., 2008), outcomes can be difficult to interpret. 

Although models of Rtl have been researched for many years, more must be 
learned about the outcomes of various choices in Rtl frameworks as they are used in 
schools. One choice faced by school districts is when to begin using Rtl to monitor and 
intervene with students who have reading difficulties. In this study, we explored the 
effects of this decision by comparing outcomes of beginning access to Tier 2 Rtl 
interventions in kindergarten or 1 st grade on end-of-Grade-2 reading achievement for 
native English speakers and for ELL, along with placement in special education. Our 
specific questions were: (1) Does access to Tier 2 intervention in kindergarten or first 
grade improve reading outcomes over untreated students at the end of Grade 2? (2) Do 
outcomes differ between students who are ELL or native English speakers? and (3) Do 
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placements in special education for students with high-incidence disabilities differ across 
kindergarten access to Rtl, first grade access, and an untreated control group? 

Method 

Study Design 

In this three-year longitudinal study, schools in Year 1 were assigned randomly 
from matched pairs to begin the Rtl model in kindergarten or first grade. Student 
outcomes at the end of Grade 2 were compared with those of students in the same schools 
who were 2 nd graders in Year 1, also called the Historical Control group. All students 
were assessed three times per year with multiple reading measures. When scores on 
these measures indicated risk at any time point, students received Tier 2 intervention 
according to their assignment (kindergarten access, first grade access, or no access for the 
Historical Control). Students continued to receive Tier 2 intervention until their reading 
scores no longer indicated risk. The details of participants and procedures are described 
below. 

Participants 

Student participants began with three cohorts—kindergarten, first, and second 
grade students—with approximately 410 students in each initial cohort. The students in 
these three grade levels attended five elementary schools in two school districts in the 
Southern California region. The schools served children whose Free and Reduced Lunch 
status ranged from 56% to 95% (California Department of Education 2008 Growth 
Academic Performance Index/Demographics) and many children were learning English 
(California Department of Education Language Census Data, 2007-08). With only a few 
exceptions, the first language for students who were English Language Learners (ELL) 
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was Spanish, and the proportion of ELL students ranged from 38.24% to 60.17% across 
schools. The English Language Arts achievement proficiency levels of the students in 
these schools, as measured by the state mandated group administered annual assessment, 
the California Standards Tests (CST), were below average (California Department of 
Education Data Quest Star Test Results 2006-07 School Reports). Across schools, the 
percent of students at the end of 2 nd grade who scored proficient on the California State 
Test (CST), English Language Arts, ranged from 23 to 45, and by 3 rd grade the range 
dropped to 17-25%. Table 1 provides by school the basic demographics of free and 
reduced lunch status, parent education level, percentage of ELL students, and percentage 
of students scoring proficient and above on the English Language Arts CST for the initial 
year of this study. 

Schools. School districts nominated potential school sites, and participation in 
the study was negotiated with the principal, the teachers, and researchers. None of the 
schools had participated previously in a model of Rtl. We collected informed consent 
letters from all teachers in kindergarten, first, and second grade at the participating 
schools to allow class-wide administered assessments and class observations. 

Of the five schools, four were matched on proportion of students who were ELL, 
received free or reduced lunches, and proficiency levels on CSTs. In each pair, one was 
randomly chosen to wait to implement the intervention portion of Rtl until Grade 1 and 
one to begin in kindergarten and grade 1; however, kindergarten and first grade students 
at risk for reading disabilities were identified and monitored at all five schools. The fifth 
school was assigned to begin access to Rtl in kindergarten based on most severe need 


(School 5). 
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Teachers. All of the kindergarten, first, and second grade teachers (n = 66) in the 
five schools participated in the study. All used the same reading curriculum (Houghton 
Mifflin Language Arts), and participated in 120 hours of language arts professional 
development on implementing this curriculum (40 hours of training followed by 80 hours 
of follow up), which was provided through the California Reading Development Center. 
This level of professional development and the reading curricula were accepted as 
evidence-based reading instruction in California’s application for Reading First funding. 
California further requires that ‘teachers of English learners (EL) hold an appropriate 
document or authorization for English language development (ELD), specially designed 
academic instruction delivered in English (SDAIE), or content instruction delivered in the 
primary language.’ (California Commission on Teacher Credentialing, Teaching English 
Learners, 2010) Districts must monitor and ensure that teachers hold appropriate EL 
Authorizations, Crosscultural Language and Academic Development (CLAD) 

Certificates, and Bilingual Authorizations (BCLAD) authorizing instruction to English 
learners. 

Students. 

K Access and Grade 1 Access. Students were classified as K Access if they: a) 
were kindergartners in Year 1 and b) attended one of the three schools where RTI 
intervention began in K. Students were classified as Grade 1 Access if they a) were first 
graders in Year 1 and b) attended one of the three schools where RTI intervention began 
immediately; or c) were kindergartners in Year 1 and d) attended one of the two schools 
where RTI intervention was delayed until 1 st grade. Thus the K Access cohort of students 
received Tier 2 treatment as kindergartners and/or first and/or second graders, depending 
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on when and for low long their scores indicated risk. The Grade 1 Access cohort of 
students received Tier 2 as needed in first and/or second grade when scores made them 
eligible. Eligibility criteria are described under Procedures and shown in Table 3. 

Of the 819 K-l students who began the study, 38% of students with K access and 
41% with Grade 1 access to Rtl had left the study by the end of 2 nd grade through attrition 
(families moving out of the participating school catchment areas), leaving 226 students 
with K Access and 386 students with Grade 1 Access to Rtl by the end of Grade 2. 
Although attrition varied by school, it did not vary across cohorts. Unlike our earlier 
studies (Author), attrition was slightly higher for average readers than for those who 
received Tier 2. 

Tier 2 participants. Among the final longitudinal sample of students who 
qualified for and received intervention at some point between kindergarten and second 
grade, 75 students were from the Kindergarten Access cohort and 139 were from the First 
Grade Access Cohort. These students represent 33% and 36%, respectively of the 
students available from these two cohorts. Among these students 74.4% were Hispanic, 

11.2 % were African American, 9.8% were White. A small number of students (2.8%) 
were either American Indian, Vietnamese, Asian Indian, or Other Pacific Islander, and 
3.7 % declined to state their ethnic status. Males were 56.5% of this at-risk sample, and 
49% of the students were English Language Learners (ELL). All but 8.5% of these ELL 
students were at the early intermediate level or below, as indicated by their scores on the 
California English Language Development Test (CELDT). Table 2 shows the number of 
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students by cohort and grade who entered and exited Tier 2 intervention across the three 
years. Of those Tier 2 students, the number who were EL is shown in brackets. 

Average readers. Among the student peers in the K and Grade 1 Access cohorts 
who did not qualify for intervention (average readers), 70.2% were Hispanic, 10.6% were 
African American, 12.5% were White and 2.9 % were Vietnamese. A small number of 
students (3.4%) were either American Indian, Korean, Asian Indian and Cambodian, or 
Filipino, and 0.5% declined to state. Males were 42.8% of this average reader peer 
sample, and 53.4% of these students were ELL. The proficiency in learning English was 
higher for these ELL students, as 24.3% scored at the Intermediate or Advanced level on 
the CELDT. By the end of 2 nd grade, the attrition rate was 67% across average-reader 
cohorts. 

Grade 2 controls. Students in the five schools who were in 2 nd Grade in the first 
year of the study formed our control group. Because 33-36% of students who had access 
to Rtl across the three years received Tier 2 at some point, we identified the bottom 33% 
of students in our Grade 2 Control to designate at-risk status. Operationally, this criterion 
selected students reading fewer than 39 wcpm in fall of Grade 2. The number of 
students selected (N=102) using this method generated a risk proportion similar to the 
risk proportions in the cohorts with access to Tier 2 intervention. Demographic statistics 
for the historical comparison group of at-risk 2 nd graders were similar to those who had 
access to Rtl, with 74.5% Hispanic, 12.7% African American, and 11.8% White. These 
students were 55.4% male and 44.5% ELLs. All but two of the ELL students in the risk 


sample were at the early intermediate or lower level on their second grade CELDT test. 
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Instruction (Tier 1) in these 2 nd grade historical control classes in Year 1 was very 
similar to the Tier 1 instruction our 2nd grade Rtl participants received in Years 2 and 3, 
because Tier 1 teachers used the same reading curriculum across all years in all classes. 
Over 80% of students in RTI cohorts had the same teachers as the 2 nd grade historical 
control. 

Students in the historical control group (N= 391) received no experimenter- 
designed Tier 2 intervention. We established with school personnel that students would 
be eligible for any school-based intervention deemed appropriate, including special 
education and tutoring programs, regardless of experimental condition. However, aside 
from special education services, no additional interventions was available for 2 nd grade 
historical control students when they were in kindergarten or first grade. In 2 nd grade, 
pull-out remedial services were offered to poor readers in the Control in three of the five 
schools, which consisted of 25-minute small group (about 6 students) reading instruction 
4 days per week from the school’s reading coach using Reading Mastery II (Engelmann 
& Bruner, 1995). Students were grouped by class for this service, rather than by reading 
ability as the program authors recommend, and remained with their group throughout the 
school year. 

Measures 

Assessments for sample selection and description. The Peabody Picture 
Vocabulary Test-3rd edition (PPVT-R; Dunn & Dunn, 1997) was used to describe 
receptive language in English for all students. The PPVT is an individually administered, 
norm-referenced measure of receptive vocabulary designed for individuals 2.5 years old 
through adult. The child selects from among four pictures, one which best represents a 
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word read by the examiner. We report standard quotient scores here (raw scores 
standardized for age in years and months at the time of testing), with a mean of 100 and 
standard deviation of 15. 

For selection for intervention and monitoring the students’ responsiveness, we 
used subtests of the Dynamic Indicators of Basic Early Literacy (DIBELS, Good & 
Kaminski 2003) in all grades and Word Identification Fluency (WIF, Fuchs, Fuchs, & 
Compton, 2004) in first grade, all of which are timed and individually administered. 
Criteria for entering and exiting Tier 2 are described in Procedures. Kindergarten 
measures. DIBELS Letter Naming Fluency (LNF) is a measure of accuracy and speed 
of naming upper and lower-case letters that are randomly arranged. LNF contains 100 
upper and lower case letters that are randomly ordered on a paper. Students are asked to 
name as many letters as they can within one minute in the order presented. The examiner 
notes errors, and subtracts errors from the total number of letters correctly named within 
a one minute period. The DIBELS initial sound fluency (ISF) is a standardized 
assessment that captures emerging phonological awareness (Kaminski & Good, 1996). 
The examiner shows a page of four pictures to a child and names each picture. The 
examiner then asks the child to point to the picture beginning with the sound /m/. The 
examiner starts the stop watch as soon as the question is asked. If the child does not 
respond in 5 seconds, the watch is stopped, the item is scored 0 and the examiner moves 
on to the next set. The child’s score is the sum of all correct responses within the 
accumulated minute period. Hintze, Ryan and Stoner (2003) reported alternate form 


reliability of .86. 
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In January, Phonemic Segmentation Fluency (PSF) replaced ISF for kindergarten 
students. DIBELS PSF measures students’ ability to segment 3- and 4-phoneme words 
into their individual phonemes (e.g., sat = /s/ /a/ /t/). The examiner orally presents the 
child with a word, and the child responds with all or some of the component parts. The 
student’s score is the number of sounds attempted minus the errors within one minute. 

The PSF one month alternate form reliability is .88 in May of kindergarten, and the 
predictive validity from Spring of kindergarten to reading connected text in Spring of 
first grade is .62 (Good et al, 2004). 

First Grade. First grade students received four individually administered 
assessments: Letter Naming and PSF (described earlier), and Nonsense Word Fluency 
from the DIBELS battery of assessments, and Word Identification Fluency (Fuchs et al.,, 
2004). 

DIBELS Nonsense Word Fluency (NWF) measures the child’s emerging sense of 
the alphabetic principle and the ability to use this concept to decode nonsense CVC 
(consonant vowel consonant) and CV (consonant vowel) words. Students are presented 
with a paper with these patterns and directed that they may read them as one word (eg. dif 
is /dif/) or as individual sounds (/d/ /i/ /f/). Either method would be scored as 3 points. 
Students are given one minute to read as many of these nonsense words as possible and 
the final score is the number of sounds attempted minus errors. The one month test-retest 
reliability for the NWF in first grade is .83 (Good et al., 2004). The predictive validity of 
the NWF for reading text (Oral Reading Fluency, ORF) is .82 in the spring of first grade 
(Good et al., 2004). First grade students who identified 25 or fewer sounds in one minute 


in fall were considered at risk. 
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Word Identification Fluency consists of word lists developed by Fuchs et al. 
(2004), which contain 100 isolated words randomly selected from Dolch pre primer, 
primer, and first grade high frequency word lists. Students are presented with this list 
and asked to read the words as quickly as they can. The score is the number of words 
read correctly within one minute, a measure of automaticity of reading skill. Fuchs et al. 
(2004) report alternate test form reliability of .91 from two consecutive months. The 
alternate-test form stability coefficient for two consecutive weeks was .92 in the 
beginning of first grade. Based on findings from the development studies of this 
measure, first graders who identified 8 or fewer words in fall were considered at risk for 
reading disabilities. 

DIBELS Oral Reading Fluency measures reading rate and accuracy. Passages for 
oral reading fluency were administered three times per year to first and second graders 
beginning in winter of first grade: fall and winter to determine risk status for Tier 2 
intervention, and as an outcome in spring of each year. It was also administered every 
three weeks to students in the Tier 2 interventions to monitor progress. The DIBELS 
ORF passages are used nationally extensively to identify students who need instructional 
support and to monitor educational progress. Benchmark passages are available for each 
grade level for Fall, Winter, and Spring of each year. The student is presented with three 
different passages and asked to read aloud for a period of one minute. Scores are 
calculated as the number of words attempted minus errors, and the median score is used 
for analysis. Alternate-form reliability ranges from .79 to .94 across measures, and inter¬ 
rater reliability for the first grade sample is 0.95. Second grade students with an ORF 
score of 35 or fewer words per minute in fall were considered at risk. 
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Outcome measures. Oral reading fluency (described earlier), subtests of the 
Woodcock Reading Mastery Tests-NU (WRMT; Woodcock, 1998), and the Gray Oml 
Reading Test-4 (GORT-4; Wiederholt & Bryant, 2001) were used as outcome measures 
at the end of 2 nd grade. The WRMT Word Identification subtest required students to 
identify words in isolation, the Word Attack subtest required students to apply phonic and 
structural analysis to pronounce pseudowords, the Vocabulary subtests required students 
to identify synonyms, antonyms, and analogies to written words, and the Passage 
Comprehension subtest required students to read 1 or 2 sentences silently with a missing 
word signaled by a blank space, and to supply a word that made sense in that space. 
Split-half reliability estimates for Grade 2 on these subtests ranged from .89-.92. 

The Gray Oml Reading Test 4 was selected to generate standardized scores for 
reading fluency and comprehension of paragraph-to-page length passages. The GORT-4 
assesses students’ reading accuracy, rate, and comprehension on passages of 50 to 200 
words in length. Students read increasingly difficult passages orally, while the examiner 
notes errors and miscues. Following oral reading, the examiner asks passage-dependent 
comprehension questions that tap a range of comprehension types, from literal to 
inferential. The reliability of the GORT-4 at ages 8-10 ranges from .89-.91. 

Procedures 

Determining risk status for entering and exiting Tier 2. Students were 
determined to be at risk if they fell below grade specific benchmarks in the fall or winter 
of the academic year (see Table 3). For kindergarten and first grade students, a 
combination of early literacy measures was examined—Fluency in Initial Sounds, Letter 
Naming, Phonemic Segmentation, Nonsense Words, and Word Identification—and 
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students scoring below these benchmarks participated in Tier 2 as their assigned 
condition gained access to the Rtl model. Fluency in oral reading of connected text 
(Dynamic Indicators of Basic Early Literacy System (DIBELS) ORF) was an additional 
criterion used to determine risk status beginning in the winter of students’ first grade 
year. Our cut-points for selecting students for Tier 2 were higher than those 
recommended in the DIB ELS manual due to our earlier studies (Author, 1999; 2005; 
2010) that found the published criteria too low to identify nearly all (90%) students who 
later demonstrated poor reading achievement in 1 st or 2 nd grade. Second grade students 
were screened for risk status exclusively with the DIBELS ORF measure. Table 3 
delineates the measures and benchmarks by grade level for entering and exiting Tier 2 in 
fall and winter of each grade. 

In our largest school, 85% of the kindergarteners would be considered at risk 
based on one of the three measures (LNF, ISF, PSF). We selected for recruitment those 
students with scores that met selection criteria on all three measures. 

Informed consent letters in English and Spanish were sent home via the school 
site to obtain permission for the student to participate in the intervention. Across the five 
schools and two grade levels of recruitment for intervention, parents of only 5 students 
either refused or did not respond. Once parental consent was obtained, student informed 
assent was solicited. Students were folded into the pool of “at-risk and eligible for Tier 
2” status based on scores collected in the fall and/or winter. Thus some students with 
access to Rtl in K or Gr 1 began Tier 2 in the fall of their access year, and others 
exceeded criteria in fall, but met criteria for Tier 2 later that year or the following year as 
their scores failed to keep up with peers. 
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Determining response to Tier 2 intervention. Researchers have used several 
methods for determining response. For consistency across years, we used the criteria in 
Table 3 not only to identify students for Tier 2 intervention, but also to determine when 
to return them to Tier 1 instruction only. Tier 2 participation was fluid, with new 
students folded into Tier 2 at any time their scores fell below the Table 3 benchmarks. 
Students who received Tier 2 intervention were monitored every three weeks using 
alternate forms of the measures that were used to select them for Tier 2. When for two 
consecutive cycles students surpassed the selection criteria for the succeeding screening 
cut-points, they stopped participating in Tier 2 intervention. For example, the cut-point 
for selecting students for Tier 2 at the beginning of Grade 2 is reading fewer than 39 
words correctly per minute on the ORF. If Tier 2 Grade 1 students read more than 39 
words per minute correctly on two consecutive ORF monitoring cycles during the winter 
or spring of Grade 1, they were released from Tier 2. Returning to Tier 1 only was not a 
permanent decision for many of the students, who were returned to Tier 2 if they failed to 
make sufficient progress in Tier 1 to surpass the cut-points in later screenings. 

Risk status of all students with access to Tier 2 was reassessed in the fall and 
winter each year. In addition to meeting the grade level risk criteria defined in Table 2, 
students also had to have been present in these schools in Year 1, although they did not 
need to be among students who were identified for Tier 2 during the first year of the 
study. We did not fold in to our sample students who were new to the school sites in 
Year 2 or 3. Students who were part of the intervention in the prior year continued in the 
intervention until they met the exit criteria in Table 3. Students who had access to Tier 2 
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in kindergarten or 1st grade entered Tier 2 in fall or winter of any eligible year based on 
these screening measures. 

Tier 2 intervention. The Tier 2 intervention consisted of small group (two or 
three students) instruction for 20 minutes in kindergarten and 25-30 minutes in first and 
second grade, four times per week. The intervention was offered from September 
through April of the academic year and delivered by project staff during the regular 
school day. Students were pulled from their general education classroom according to 
classroom schedules such that Tier 2 instruction supplemented, rather than supplanted, 
English Language Arts instruction or English Language Learner programming. 

Kindergarten. Tier 2 activities were drawn from Ladders to Literacy (O’Connor, 
Notari-Syverson, & Vadasy, 2005) to improve oral language, phonological awareness, 
letter knowledge, and integration of the two to develop the alphabetic principle. These 
activities have generated significant improvement in phoneme awareness for low-skilled 
students in field trials (Luchs, Luchs, Thompson, A1 Otaiba, Yen, Yang et al., 2001; 
O’Connor, Notari-Syverson, & Vadasy, 1996), and have shown evidence as an effective 
Tier 2 intervention in kindergarten (O’Connor et al., 2005; Vaughn et al., 2003). 

Alphabet letters and their most common sounds were introduced at a rate of one 
or two letters per week, depending on student progress. Phonemic awareness activities 
began with saying words slowly so that students could hear the individual speech sounds 
within one-syllable words. Most words were represented with pictures and objects to 
make them more concrete, teach the names for objects, and enable students who were 
ELL to participate more easily. Students generated a one-sentence message that was 



26 


Running head: Access to Rtl 

written by the Tier 2 teacher and used for finger-point reading and for identifying letters 
and sounds that had been taught. 

Letters and sounds were integrated with first sound activities so that students 
could begin to apprehend the alphabetic principle, and by December letters were used in 
onset-rime blending and segmenting and students manipulated letters on cards to 
represent where the letters would occur in a word. Students also began writing a letter to 
represent a sound routinely in lessons. 

By March students segmented words into three phonemes and represented all 
phonemes in words with letter tiles in an activity called Segment-to-Spell. We increased 
the difficulty of activities based on observations of students in groups and their scores on 
the progress monitoring measures. Students whose progress exceeded our cut-points for 
beginning of first grade were dropped, or exited, from Tier 2, but were monitored on the 
same schedule as other participants to insure that they maintained the levels they 
achieved during the treatment. 

First grade. Tier 2 in 1 st grade was based on Sound Partners (Vadasy et al., 2005) 
and included letter-sound, decoding, sight word identification, and reading of sentences 
and decodable books in small groups for 25-30 minutes four times weekly. These 
activities have generated significant improvement for low-skilled 1 st and 2 nd grade 
students when delivered as intensive, supplemental instruction (Vadasy, Jenkins, Antil, 
Wayne, & O’Connor, 1997; Vadasy, Sanders, & Tudor, 2007). 

Each lesson contained multiple activities. First, a new letter or letter combination 
was introduced about every two lessons and students reviewed previously taught letters 
and patterns. Phonemic awareness activities began with auditory blending and 
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segmenting in fall, but were phased out as students mastered these skills, and were 
replaced with spelling and writing activities, in which students focused on letters and 
patterns they had learned and decoding words. These reading and spelling activities 
provided practice opportunities to apply the new and reviewed letters and patterns. High 
frequency sight words were introduced through a cumulative introduction technique that 
included frequent review. Reading of a decodable sentence was introduced during 
Lesson 3, and by Lesson 9 at least two sentences were read daily. To provide more 
practice reading sentences and begin story reading, the Bob Books (Maslen & Maslen 
(2000) were used throughout Tier 2, along with the Decodable Books series from Open 
Court. These books begin with controlled vocabulary (e.g., “Mac had a bag and a dog.”) 
and become more various in vocabulary as students progress through the sets. Midway 
through first grade, about 10 minutes of the Tier 2 lessons focused on reading and 
rereading decodable books. Word endings and rules such as “magic e” were also 
included midway through the year. Most groups did not complete a lesson in a single 
session; on average students completed between 70-80 lessons in Sound Partners. 

Second grade. Tier 2 lessons in 2 nd grade included many of the 1 st grade 
activities (i.e., word study, reading and rereading books at their current reading level, and 
brief spelling and sentence-writing opportunities), but also brief lessons on word 
meanings and comprehension checks. The materials for these lessons included leveled 
reader series (e.g., I Can Read Books 1 ) and word patterns drawn from Teaching Word 
Recognition (O’Connor, 2007). Half of each lesson was devoted to reading and rereading 

1 The I Can Read Books include multiple titles at Levels 1 and 2, and are published by Harper 
Collins Publishers, New York. 
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text, one-third to word level reading and writing, and about 5 minutes to comprehension 
strategies (e.g., main idea, searching for details, and finding clues for making inferences), 
justifying responses with sentences in text, and sentence writing. These lessons were 
developed for this study and used across Tier 2 groups; however, the level of materials 
varied depending on students’ rate of progress. 

Fluidity across Tiers. In each grade, students receiving Tier 2 who scored above 
the risk range for two consecutive screening periods were released from intervention, but 
continued to be monitored throughout the year. Likewise, students who fell below the 
cut-points on fall or winter measures administered to all students (including students who 
were initially labeled typically developing readers) were folded into Tier 2 instruction. 
Thus our Tier 2 sample received intervention as needed (approximately the lowest 25% at 
each measurement cycle), but only about 12% of the sample participated in Tier 2 
continuously during the two or three years they had access to the Rtl model. Because our 
intent was to consider effects of access to Rtl, rather than a set time period of 
intervention, eligible students received varying amounts of Tier 2 instruction over one to 
three years, depending on whether their access to Rtl began in kindergarten or 1 st grade. 
We present statistics on the amount of intervention students received in the Results. 

Tutors and training. Tutors were hired and trained by project staff, and included 
experienced special education teachers, classroom teachers, graduate students, and 
teacher aides. Across the staff, 61% of these individuals were tutors for the entire three 
years of this research; 88% were with the project for two or more years. The lead tutor in 
each school participated in 30 hours of training on reading development in grades K-3 
and instructional activities focused on components of reading. All tutors received 
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training from the PI in instructional delivery of the specific Tier 2 package for each grade 
level (Ladders to Literacy for kindergarten, Sound Partners for first grade, decodable 
books with fluency practice and comprehension activities in second grade). Each year, 
the initial training lasted four hours, and included a theoretical introduction of each 
activity, modeling of the activity, guided practice, independent practice in small groups 
with observation, feedback, and discussion of common problems. In addition to the 
teacher manuals which were part of the published curricula for Ladders to Literacy and 
Sound Partners , tutors received a teacher manual generated by the researchers. This 
project specific manual included detailed descriptions of student activities, teacher 
scripts, a pacing guide for daily lessons, a pacing guide for monthly progress based on 
average growth, and flow charts linking specific types and levels of activities to progress 
monitoring scores. This initial training was supplemented by two hour bi-monthly follow 
up training (again taught by the PI) where new activities were introduced, common issues 
noted during field observations were discussed, and additional practice provided. Project 
staff also met bi-weekly in small groups by school site and reviewed progress monitoring 
data and monthly lesson plans for each individual student. These reports and plans were 
reviewed and approved bi-weekly by the PI, lead instructional staff, and the project 
director. 

Treatment Fidelity 

An experienced classroom or special education teacher was designated as the lead 
tutor at each site. In addition to observing, supporting, and providing feedback to the 
project staff, the lead tutor oversaw weekly the progress of students and modifications to 
the monthly lesson plans. The lead tutor collected daily activity logs completed by tutors 
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for each of the small groups, and these were reviewed by the authors. In reviewing the 
activity logs, we examined completion of the each of the steps/activities outlined in the 
teacher scripts, and progress through the lesson sequence. Progress below the pacing 
guidelines triggered a conference where activities and/or pacing were changed. 
Depending on the grade level, progress through the curricula was defined as shifts from 
phonological to phoneme-level activities (Kindergarten: Ladders to Literacy)', successful 
completion of Mastery Tests for blocks of lessons (First and second grade, Sound 
Partners ); and movement through a series of leveled readers of increasing difficulty with 
85% accuracy (second grade curricula). 

Daily activity logs were entered into the annual data base, and included 
information regarding the minutes of compiled treatment, the activities completed within 
the lessons, and the tutor’s assessment of the child’s performance. These activity logs 
were compared with observations of tutors conducted by the PI, instructional staff lead, 
and the project director. The fidelity observations were specific to each grade level 
curricula and included direct actions on the part of the tutors for each segment of the 
lesson. Fidelity was computed as a percentage of all observed actions compared to all 
actions expected. If any observation fell below 85% fidelity, the tutor was provided 
coaching and feedback, followed by co-teaching with the lead tutor until acceptable 
fidelity was reached. Fidelity observations were collected every 6 weeks on all tutors for 
each of the three years of the study. During this time period, we observed three instances 
(4% of all observations) of less than acceptable fidelity in Year 1, six instances in Year 2 
(3% of all observations) and no instances in Year 3. In each of these cases, fidelity 
immediately rose to acceptable levels after the corrective action noted was applied. The 
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average fidelity rating for Ladders to Literacy was 93.26%; for Sound Partners 92.06%; 
and for second grade curricula 89.4%. 

Results 

We tested for comparability across cohorts in three ways: differences between (1) 
kindergartners assigned to access in kindergarten or 1 st grade, (2) receptive vocabulary 
across the three cohorts, and (3) 2 nd grade reading ability for the Tier 1 (untreated and not 
at risk) students across cohorts. First, we tested for potential differences between children 
in kindergarten in schools assigned to have access to Tier 2 in kindergarten (3 schools) or 
first grade (2 schools). Multiple analysis of variance (MANOVA) on Fall early reading 
(ISF and RLN) and vocabulary measures in kindergarten was not significant (Wilks’ 
lambda F( 3 , 405 ) = 0.744, p=.476). Means and standard deviations are shown in Table 4. 

Next, we tested for pre-existing differences in vocabulary across the three cohorts 
(Grades K, 1, or 2 in Year 1). Analysis of variance (ANOVA) for receptive vocabulary 
(PPVT) in the first year of the study across conditions revealed no significant difference 
in language ability across cohorts (F ( 2 , ii 83 ) = 0.826, p = .43). Because we expected 
differences due to intervention, we also conducted ANOVA on 2 nd grade reading rate for 
students in Tier 1 across cohorts: the approximately 70% of each cohort considered 
typically developing readers. This test was not significant (F ( 2 , in) = 1.03). These 
typically developing readers did not participate in intervention during the study years, 
and together these results suggest no differences across cohorts due to general class 
instruction (Tier 1) or student vocabulary across cohorts. Means and standard deviations 


are shown in Table 4. 
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Analytical Decisions 

To determine whether a hierarchical model was needed, we first determined 
whether to nest the data within schools. A one-way random effects ANOVA was 
computed for the outcome variables as a preliminary step for a hierarchical linear 
modeling analysis. Data were analyzed at the student and school level to provide 
information regarding variability of outcome measures at both levels and the reliability of 
the school means surrounding the population mean. At the student level, the basic 
equation is: 

Reading Performanceij = Boj + fij 

where Boj is the average reading performance at School j, and ry, the person-level effect, 
which is the deviation from the mean for Student i in School j. Reading Performanceij is 
the dependent variable, oral reading fluency, for Student i in School j. At the school 
level, the basic equation is: 

Boj = Goo + Uoj 

where Goo is the grand mean for reading performance, Uoj is the deviation from the mean 
for School j, and Boj is allowed to vary randomly across schools. 

Estimates for the model are provided in Table 6. The grand-mean ORF 
achievement in the spring of Grade 2 was 57.75 with a standard error of 1.39 (95% Cl = 
55.03, 60.47). T-results were significant suggesting the average ORF achievement was 
statistically different from 0, t( 4) = 41.599, p = 0.000. At the student level, the variance 
component was 485.12. The variance component at the school level, 3.96, explained the 
deviation of each school’s average ORF achievement surrounding the grand mean. It is 
anticipated that 95% of the school means would fall between 53.85 and 61.65. The 
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significance test was not significant, % 2 (4) = 5.986, p = 0.199, indicating no significant 
variation in ORF performance across schools. 

The intraclass correlation coefficient, 0.0081, indicated that about 1% of the 
variance in spring ORF achievement was between schools. The reliability estimate of the 
sample means was 0.326 indicating moderate reliability. The above results suggest that 
an analysis across schools is unwarranted due to the limited variability at level-2 
(Raudenbush & Bryk, 2002). Because the classroom teacher and the Tier 2 tutor changed 
annually across the three years of the study, we could not consider further nesting of the 
data. 

We did not use a growth model across cohorts because we had multi-year data for 
students in the Rtl cohorts, but only Grade 2 data for the historical control. For the same 
reason, we did not covary for kindergarten scores, as these scores were unavailable for 
students who were 1 st or 2 nd graders in Year 1. 

Minutes of Tier 2 Intervention 

Table 7 shows the average number of minutes of Tier 2 intervention received by 
students who had access to Rtl in Kindergarten or 1 st grade. Unlike dosage studies, in 
which students are assigned to a treatment to receive a specified numbers of minutes, 
participation in Tier 2 in this study was dependent on student response to intervention. 
Thus students who were less responsive received more minutes of intervention than those 
who responded quickly to increased instructional intensity, and good responders returned 
to instruction in Tier 1 only. 

Because the students who were the least responsive received the greatest number 
of minutes in treatment, we did not expect ‘minutes of Tier 2’ to have a positive 
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relationship with reading outcomes, and that is what we found. The correlation between 
minutes of Tier 2 and end of Grade 2 total score on the WRMT was -.30 for students with 
K Access and -0.28 for students with Gr 1 Access. 

Effects of Kindergarten vs. First Grade Access to Tier 2 Intervention 

Students were classified as Kindergarten Access if they had been assigned to the 
Kindergarten (K) Access cohort and received Tier 2 intervention, regardless of when they 
received Tier 2 intervention. This means that the K Access Cohort (N=226) included 
students whose scores made them eligible for Tier 2 in kindergarten, and also students 
whose scores were above the cut-off in kindergarten, but who fell below first or second 
grade cut-offs and were included in Tier 2 when their scores met inclusion criteria. 
Likewise, students in the Grade 1 (Grl) cohort (N = 386) received Tier 2 as needed in 
first and/or second grade as their first and second grade scores made them eligible. 

In Grl we compared reading achievement between students at risk for reading 
difficulties (i.e., scoring below time-dependent cut-points) who had access to Tier 2 
intervention as needed beginning in K or Grl (results follow). In 2 nd grade, outcomes of 
students who had the opportunity to participate in Rtl from K or Grl were compared with 
the second graders in the same schools who were measured in Year 1, prior to 
implementation of the model (the historical control). 

Effects on reading in Grade 1. We conducted MANOVA on beginning and end 
of Grade 1 reading achievement for students at-risk, using access to Tier 2 intervention in 
K or Grl as the between variable. Due to collinearity among the WRMT subtests, we 
used the Total Reading score in these analyses; however, we report subtest scores also in 
Table 8. Fall reading achievement (NWF, WIF, and WRMT) favored students who had 
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K access to Rtl (Wilks’ lambda F( 3 , 238 ) = 3.721, p=.01) over students who began access to 
Rtl in 1 st grade. Effect sizes (using the pooled standard deviations) across the 3 measures 
were 0.72, 0.55, and 0.38, respectively and averaged as medium effects (Cohen, 1988). 

By the end of first grade, outcomes (WIF, ORF, and WRMT) continued to favor students 
in the K start cohort (F( 3 , 238 ) = 4.072, pc.00). Effect sizes across the 3 measures were 
0.30, 0.46, and 0.48, respectively, ranging from small to medium. Means and standard 
deviations for the two treated groups are shown in Table 8, along with scores for typical 
readers who consistently fell above cut points for inclusion in Tier 2 intervention. 

Effects in Grade 2. To test for 2 nd grade effects of K or Grl access to Rtl, 
MANOVAs followed by pairwise comparisons (using Bonferoni adjustments) were 
conducted between students at risk in each of the two treated samples and the historical 
Grade 2 control. Means and standard deviations for measures of ORF, WRMT, and 
GORT-4 are shown in Table 9. 

In Fall of Grade 2, MANOVA that included ORF and WRMT was significant 
(F(4, 602) = 18.266, p < .01). Pairwise comparisons revealed that both treatments 
outperformed the Control significantly for each measure, and the K cohort performed 
significantly better than the Grl cohort on the WRMT, with an ES of 0.38. By contrast, 
the ES between at-risk students with K Access and the Control group was large (ES = 
1.23). 

At the end of 2 nd grade, MANOVA that included ORF, the WRMT, and the 
GORT-4 showed that both treated groups outperformed the Control on all measures (F( 6, 
600) = 12.31, p < .01. Scores of students with K Access to Rtl did not differ 
significantly from Grl Access on reading achievement at the end of Grade 2, although 
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the effect size of the difference between K and Grl on the WRMT was over .3sd. Effect 
sizes between the experimental groups and the Control were consistently larger than 1. 

Disaggregating access by responsiveness. Another way to consider the 
difference in K vs. Gr 1 Access is by returning to Table 2, which shows the number of 
students who received Tier 2 by cohort and access year over time. When we examine the 
K Access children, 34 of the 75 students (8 + 11 + 15) met exit criteria by the close of 2 nd 
grade (45% of this sample). For students with Grade 1 Access, whether they were in 
kindergarten or first grade in Year 1, 36 (11 + 13 + 5 + 7) of the 139 students (40 + 7 + 

30 + 62) met exit criteria by the close of 2 nd grade (26% of this sample). Of these 36 
‘exiting’ children, two thirds of them (from the wait list kindergarten cohort) were 
considered not at risk in kindergarten based on our progress monitoring: they began to 
need assistance only when first grade reading demands increased. While we do not know 
the levels of kindergarten risk for the first grade cohort, we can reasonably assume that a 
proportion of these children were struggling in kindergarten for a full year prior to 
receiving first grade Tier 2 intervention. 

Disaggregation of the students who entered and exited in first grade lends support 
to this assumption. A total of 69 students (29 + 40) from the monitored Kindergarten 
cohort entered Tier 2 intervention as first graders. From these students receiving Tier 2 
first grade intervention, 22 (11+ 11) met exit criteria: 4 who were placed as 
kindergartners, and the remaining 18 after just one year of first grade intervention. In all, 
32% of these students exited from Tier 2. Compare this with the first grade cohort whose 
first access to Rtl was Gr 1; only 17% (5 of the 30 students) met criteria to exit Tier 2 


intervention. 
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Perhaps the most dramatic comparison between the K and Gr 1 cohorts is the 
number of students who were found at-risk for RD for the first time in 2 nd grade. 
Sixty-two children from the Gr 1 cohort who did not meet criteria in first grade 
were eligible for Tier 2 by the beginning of second grade; yet only 13 (6 + 7) from 
the K cohort qualified for the first time in 2 nd grade. Further, of the second graders 
receiving Tier 2 instruction, 30% of the K cohort (15 + 13 / 56 + 36) were able to 
meet exit criteria at the end of the year, while only 8% (7/87) of the Gr 1 cohort was 
able to do so. This analysis for students who were ELL is described below.Effects 
for Students Who Were English Learners 

At the end of 1 st grade, EL students who began access to Rtl in K or Grl did not 
differ statistically (Wilks’ lambda F(3, 103) = 1.61). By the beginning of 2 nd grade, EL 
with K Access performed significantly better on ORF and the WRMT than those with 
Grl Access (F(2, 104) = 3.76, p < .05) and by the end of 2 nd grade, this advantage 
remained for the WRMT (F(l, 105) = 4.03, p < .05), but not for ORF or GORT-4 scores. 
Rtl benefited students who were EL regardless of K or Grl access over EL students in the 
Control (F(6 , 248) = 5.58, p < .01, with partial eta squared of .12). Table 10 shows 
means and standard deviations for students by condition. 

For EL students, the patterns of entering Tier 2 (see Table 2) were similar to those 
of native English speakers, but the success rates were higher. Of the 37 (25 + 12) K 
Cohort ELL students who received Tier 2 intervention, 18 (7 + 6 + 5), or 49% exited by 
the close of 2 nd grade. For students with Grade 1 Access, whether they were in the K or 
Grade 1 cohort, 22 (4 + 8 + 4 + 6) of the 74 (23 + 5 + 16 + 30), 30 % of this sample, met 
exit criteria by the close of 2 nd grade. Examination of the students found to be at-risk for 
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RD for the first time in second grade reveals that approximately half of the K Cohort (5 / 

6 + 7) and half of the Grade 1 Cohort (30/60) were EL. 

Identification of Students for Special Education 

The final question is whether access to Tier 2 intervention in kindergarten or first 
grade changes the proportion, pattern, or characteristics of students who are identified for 
special education. In comparing the RTI cohorts to a historical control for special 
education proportions of incidence and ethnic representation, we limited our 
consideration to students with classifications of Specific Learning Disabilities and 
Speech/Language disabilities, as these conditions would be those most amenable to the 
influence of RTI. This decision eliminated about 20% of students receiving special 
education in each cohort with classifications such as Hearing Impaired, Moderate 
Cognitive Impairment, or Autism. 

Lor the historical control group of 381 second graders, 20 were placed in special 
education by the end of 2 nd grade, a proportion of 5.25%. Of these students, 12 were 
placed prior to second grade. Regardless of placement grade level, only half (10) of these 
special education students met the at-risk criteria based on ORL (39 or less wcpm) in 
second grade. Using this same criterion for at risk status, 102 of the historical control 
second graders were deemed at risk. Among these at-risk students, 9.2% of the Hispanic, 
7.69% of the African American, and 16.6% of the White students were placed. Among 
the ‘not at risk’ historical control students (n=279), 3.28% of Hispanic, none of the 
African American, and 9.09% of White students were placed in special education by the 


end of second grade. 
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For the sample of RTI Access students (n = 214), 11 were placed by the end of 2 nd 
grade, a proportion of 5.19%. Of these 11 students, 6 were placed in special education 
prior to second grade. Regardless of placement grade level, all but one of these 11 
students met the at-risk criterion based on ORF (less than 39 wcpm) in the beginning of 
second grade. Among the total sample of students found to be at risk during this study, 
5.98% of Hispanic, 5.00% of African American, 7.69% of White, and 14.29% of Other 
were placed in Special Education by the end of second grade. 

Regarding the impact of K versus Grl Access on special education placement, 4 
of the 68 students with K Access to RTI were placed in Special Education. These 
students (2 Hispanic, 1 African American and 1 White) all met the 2 nd grade criterion of 
at risk. With one exception, all were placed at the end of second grade. For the 144 
students with Grl Access, 7 were placed in special education. Among these seven 
students, all but one were placed prior to the end of Gr2. Reading outcomes for students 
placed in special education by cohort are shown in Table 11. 

Discussion 

As other studies have found (Connor et al., 2007; O’Connor et al., 2005; 

Simmons et al., 2008; Vaughn et al., 2009), participation in an Rtl model of early reading 
intervention improved reading outcomes over high-risk students in the same settings prior 
to implementing a tiered intervention model (our same-schools, Grade 2, historical 
control), whether access became available in K or Grl. These differences were robust. 

Our study was designed to determine whether beginning access to Tier 2 
interventions as needed in K would improve long-term outcomes over beginning in Gr 1. 
The answer seems to be that it depends on which outcomes are most valued. Beginning 
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in K gave students a boost that lasted through the beginning of Gr 2, but this advantage 
diminished between the treated samples over the 2 nd grade year. At the end of 1 st grade, 

K Access to Tier 2 improved reading outcomes over Gr 1 Access by over 1/3 of a 
standard deviation on the WRMT. By the end of 2 nd grade, the difference due to K 
Access was less than *4 of a standard deviation and no longer significant. 

Examining the proportions of students initially at risk who remained at risk 
through the end of Grade 2 tells another story. In the K Access cohort, 45% of students 
who received Tier 2 met exit criteria by the end of 2 nd grade, compared to 26% of 
students with Gr 1 Access. Of the students identified as at-risk at the beginning of Gr 2 
in the Historical Control, only 2% met the exit criteria of ORF > 90 wcpm at the end of 
the year. These statistics suggest that earlier intervention can shrink the proportion of 
students who remain in high-risk groups for reading difficulties. 

One possible conclusion from this analysis is that in the absence of intervention in 
pre-literacy skills in kindergarten, the students in our first grade cohort developed deficits 
that grew more troublesome as the reading demands increased across first and second 
grades—the so-called Matthew Effect (Stanovich, 1986). For students with K Access, 
Tier 2 intervention was (for some, not all) timely enough to push students past the wave 
of reading skill deficits. Such focused intervention may also enable struggling students to 
become attuned to reading instruction, making them more responsive to the instruction 
that is offered. 

Students Who Were English Language Learners 

Having access to early intervention in K was more important for students who 
were ELL than for native English speakers (NES), and we had hypothesized that K 
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Access could be particularly helpful for ELL students. Although the lack of English 
fluency for ELL students may increase the false positive identification of at-risk status 
(Klingner et al., 2006), we reasoned that the additional access to small group, intensive 
intervention in English could provide a lasting benefit to the K Access cohort of ELL in 
vocabulary as well as in reading, which could in turn improve reading comprehension. 
Our hypothesis was only partially borne out, at least with the power to demonstrate a 
significant difference. Lor students who were ELL, participation in Tier 2 regardless of 
start year improved reading outcomes significantly over ELL in the control group. 
Regarding when to begin access to an Rtl model, differences between students with K 
versus Grl Access on untimed measures of reading (i.e., most subtests of the WRMT) 
persisted through the end of Gr 2. Surprisingly, these differences were found on reading, 
and not specifically on the language measures (i.e., PPVT-III or WRMT vocabulary). 

Considering why we found no difference on the language measures, the PPVT 
and WRMT Word Comprehension measures were each designed for a broad age span, 
and may have lacked a sufficient number of items for students in Grades K-2 to detect 
differences in language development if they existed. Moreover, bifurcating the sample 
into ELL and non-ELL along with attrition took their toll by the end of 2 nd grade, and 
both of these factors decreased power to detect effects. Other explanations could also 
account for the lack of significant difference on vocabulary measures for students with K 
Access. Most important may be that access did not mean ongoing placement in Tier 2. 
Where in other studies of RtFs effects on vocabulary and reading development 
researchers have assigned students to interventions for specific amounts of time 
regardless of growth, our study was flexible and allowed nimble movement across tiers 
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based on students’ most current reading scores. In the ELL sample, as well as for the 
NES, students only participated in Tier 2 when their reading scores indicated risk. Some 
students in the K cohort did not demonstrate risk until 1 st grade, and so students in the K 
cohort did not necessarily have an extra year of Tier 2 intervention. 

Several studies in the last few years (Crosson & Lesaux, 2010; Mancilla-Martinez 
& Lesaux, 2010, 2011) have found that ELL students’ reading skills develop unevenly 
due to the differential relationship between word reading skills and reading 
comprehension over time. Although decoding and word recognition may show adequate 
development in the first several years of school, reading comprehension often lags behind 
significantly. Inspection of Table 10 shows a gap between word reading and passage 
comprehension of about 1/3 standard deviation (5-7 standard score points) on the WRMT 
in Gr 1 for the ELL students, regardless of risk. By the end of Grade 2, the gap has 
narrowed to 3 standard score points, with K Access students in the average range for 
word identification and reading comprehension. 

As Klingner et al. (2006) and others have suggested, poor English reading skills 
due to learning English is a different cause for poor reading achievement than poor 
reading skills in a child’s first language. As ELL students improve their English 
language understanding over time, some students identified for Tier 2 improved their 
reading skills rapidly and no longer needed assistance in succeeding years. In this study, 
28% of the ELL identified in K caught up sufficiently by the end of K to need no 
additional assistance with reading. By the end of Grl, an additional 25% caught up. 
Overall, our measures identified a proportion of ELL students for Tier 2 that was similar 
to the ELL population in the participating schools. 
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Late-emerging Reading Difficulties 

Unlike earlier studies in which the risk sample was identified in the first year of 
the study and remained constant, we identified new students for Tier 2 intervention across 
years as their scores on screening measures fell below the cut-points that indicated risk, 
as long as students were present in the schools in Year 1. This model emulates how 
schools tend to orchestrate Rtl models (e.g., see Mellard et al., 2009) and may have 
reduced the bias favoring Rtl results in studies that selected only kindergarten or 
beginning first graders as their intervention sample, thereby excluding students whose 
reading difficulties emerged later. 

This distinction is important because 36% of the students who received Tier 2 
intervention at some point across the three year study received it for the first time in 
Grade 2. These students’ scores were above the cut-points for Tier 2 intervention at all 
time points across Grades K-l. 

Unlike Kieffer’s (2010) study that reported high proportions of late-emerging RD 
for ELL, our proportions of late-emerging RD identified for the first time in Grade 2 were 
somewhat less for the ELL (29% of the total at-risk ELL sample) than for the NES 
students (44% of the total at risk NES), even though the progress of all students had been 
monitored since the beginning of the study. Considering all students who were identified 
for Tier 2 for the first time in 2 nd grade, the proportions of ELL and native English 
speakers did not differ. ELL students were included in Tier 2 based on their reading 
scores and not language ability, which may have decreased the proportion of late- 
emerging RD for the ELL students. Regardless of when they had access, some of our 
participants did not need Tier 2 instruction until the second or third year of the study. 
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Comparison of end-of-year scores across Tables 8, 9, and 10 also demonstrates that ELL 
students at-risk grew similarly to native English speakers in 1 st and 2 nd grades despite 
lacking proficiency in English. 

Identifying Students for Special Education 

In an experiment using Rtl as a model for decreasing special education 
placements, Wansek and Vaughn (2011) found that the implementation of tiered reading 
intervention school-wide produced measurable, though not significant, decreases in 
special education placement. The authors note that this drop may be a regression toward 
the national mean, because the study schools had been previously placing students in 
special education at a rate of 19.37%, well above the national average of 12%. As in 
VanDerHeyden et al.’s study (2007), eligibility categories were condensed across types 
of disability, which decreased differences that could be due to reliability of identification 
of students in high incidence categories, such as LD. 

In our control group the special education placement rate was 6.4%, a smaller 
figure to reduce, and in fact, overall access to Tier 2 instruction did not decrease referrals 
or placements. The presence of Tier 2 may have added to the earlier detection of students 
with high incidence disabilities, who may have received Tier 2 intervention for a period 
of time, did not respond, and were then referred for special education consideration. 
Similar to national statistics, LD placements were 50% of the classifications for the 
sample with earliest access to Tier 2 intervention. 

Although teacher referral may not be the most reliable way to identify LD, teacher 
ratings of reading ability and reading problems have been tightly linked in the research 
literature to reading status at year’s end in first grade (Speece, Schatschneider, Silverman, 
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Case, Cooper, & Jacobs, 2011) and in fourth grade (Speece et al., 2010); however, 
potential bias in teacher referral has also been linked to overrepresentation of minority 
students. Having access to Rtl did not decrease the proportion of students identified for 
special education, nor did it have a significant impact on the timing of identification. 
When students with high incidence disabilities are considered across cohorts, we found 
that those who had participated in Tier 2 interventions were nearly all identified for 
special education due to poor responsiveness to those interventions. All but one of these 
students with disabilities met poor-reader status at the end of Gr2. Among students with 
high incidence disabilities in the historical control group, only half met criteria for poor 
reader status. This difference suggests that participation in a model of Rtl may reduce 
teacher bias associated with the ‘judgmental’ categories of disability. 

Moreover, no students in the historical control cohort who were ELL were placed 
in special education by the end of Gr2. Although disinclination to refer or place ELL 
students could be viewed as sensitivity to language differences that can contribute to 
overrepresentation of minorities in special education (e.g., Cross & Donovan, 2002), 
ignoring the special needs of ELL students in the primary grades could also contribute to 
the overrepresentation of ELL students in special education by middle school (Kauffman, 
Conroy, Gardner, & Oswald, 2008) and to their late-emerging RD (Kieffer, 2010). Our 
data support the notion that using Rtl as one indicator of need for special services may 
reduce teacher bias and make access to special education more equitable across ethnic 
groups. 


Issues of Measurement in Rtl Models 
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Screening children for potential reading difficulties and monitoring their progress 
during interventions are primary components of every Rtl model. Although we have 
addressed the issue of access to Rtl in this study, notions of when to screen students for 
reading difficulties may be as important as when to treat them. Identifying students for 
Tier 2 in kindergarten provides more opportunity for early intervention before difficulties 
become severe; however, kindergarten screening tools are less reliable indicators of 
reading problems than measures of actual reading a year later. 

Quick screens such as segmenting spoken words into phonemes in kindergarten or 
reading rate in 2 nd grade can help to reduce teacher bias and identify children who may 
need additional, more effortful assessment to validate whether reading problems exist 
(Compton et al., 2010; Fuchs, Fuchs, & Compton, 2012). Nevertheless, some of the 
widely used additional and effortful assessments, including the WRMT or GORT, have 
their own much-discussed problems, especially for assessing reading comprehension, 
which is the key indicator of late-emerging LD (Catts et al., 2012). 

Determining reliable screening procedures for comprehension is important 
because measures of isolated skills (letter naming, segmenting, or word recognition) miss 
children who later develop reading difficulties due to language and comprehension 
problems that went untested in earlier grades. Finding appropriate measures for potential 
reading comprehension difficulties has proved problematic. For example, some 
researchers have suggested that passage comprehension as measured by the WRMT relies 
more heavily on word decoding than on comprehension of text (Francis, Fletcher, Catts, 

& Tomblin, 2005; Keenan, Betjemann, & Olson, 2008). Keenan and Betjemann (2006) 
found that older students with poor decoding skills score higher than expected on the 
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GORT, because it is possible to guess correct answers to some questions through 
background knowledge without reading the passages. 

In our work, poor readers in elementary school consistently score lower (i.e., 
standard scores) on the GORT than the WRMT, and so while we have not seen (and do 
not see in this study) problems of score inflation on the GORT, we suspect score inflation 
on the WRMT. Not only were our Tier 2 participants in the lowest quartile of their 
grade, but they were also the lowest quartile in low-achieving schools that had qualified 
for ‘school improvement’ under the No Child Left Behind Act. A glance at the receptive 
language (PPVT) scores in Table 3 shows that these students on average were about 10 
standard score points below that test’s national mean. Nevertheless, on the WRMT 
subtests they score well into the average range, even though their teachers still 
acknowledge them as poor readers in need of remediation. It is possible that the national 
focus on early reading and academic content in kindergarten has skewed the norms on 
this measure. For the research and practice communities, renorming existing measures 
and developing quick screens that are sensitive to comprehension in the early grades 
should be priorities for research on practices used in models of Rtl. 

Limitations 

As other studies have found, it is likely that we over-selected kindergartners as 
being at risk. Remember however that the kindergarten sample was disproportionately at 
risk; at one school, 85% of the kindergartners fell within the Tier 2 eligibility criteria 
based on one or more of the three early literacy measures (LNF, ISF, PSF). We elected 
to take the students most severely in need, with scores below our cut points on all 


measures, and continued to monitor the other students. These students were added to the 
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Tier 2 intervention if they did not show growth in general classroom instruction, which 
qualifies the notion of kindergarten access. Because we worked with the more severely 
at-risk students, the problem of over-identification was minimized; our treated 
kindergarten sample may be more comparable in certainty of identification to the first 
grade identified sample than has been true for other studies. Nevertheless, due to three 
years of monitoring student progress, we also “found” students in first and second grades 
whose early literacy measures in kindergarten had placed them above the high-risk cutoff 
for intervention. Across cohorts, the proportion of students selected for intervention 
each year remained near 25%. As some students met criteria to exit Tier 2, others became 
eligible, most likely due to the increasing complexity of “what counts” as reading in 1 st 
and 2 nd grades. 

Although attrition was large (about 40% by the end of Grade 2), it was in line 
with other longitudinal studies of Rtl. As examples, O’Connor (2000) reported 29% of 
participants lost over two years. McMaster et al.’s (2005) intervention lasted only seven 
weeks; nevertheless, they lost 7% of students. In Simmons et al.’s (2008) longitudinal 
study of Rtl, 51% of at-risk kindergartners remained in the study at the end of Grade 2. 

Although the teachers were mainly the same across all three cohorts, the effects 
reported here may be due, in part, to a systemic change in teachers’ awareness of reading 
difficulties in participating schools, which could also account for the differences between 
students’ reading ability in special education in the cohorts with and without access to 
Rtl. This possibility warrants further research attention. 

Overall, the limitations in this study reflect the hazards of research in public 
schools. Nevertheless, studies that mirror how schools use Rtl to identify students for 
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intervention and to consider response in eligibility decisions for special education are 
essential. Without such studies, it will be difficult for researchers to recommend a range 
of applications that could improve student reading outcomes and the reliability of special 
education placements. 

Implications 

In this study, whether having access to structured Tier 2 intervention as early as 
kindergarten mattered depended on when reading achievement was measured and who 
was measured. Differences were significant throughout first grade, but waned in second 
grade, except for students who were ELL, for whom earlier intervention was consistently 
better than later. 

Perhaps more importantly, a substantial proportion of students who eventually 
received Tier 2 scored too high in K in relation to their peers to be identified that early. 
Because we monitored the reading skills of all students in these schools three times per 
year, we can verify that over 20% of our Tier 2 sample in 2 nd grade were developing 
typically in the early literacy skills of phonemic segmentation and letter names and 
sounds in kindergarten, and only later (in Grade 1 or 2) demonstrated difficulties with 
words, reading rate, and/or comprehension. Whereas other research teams have 
identified late-emerging RD after third grade (e.g., Catts et al., 2012; Kieffer, 2010), our 
study suggests that measures commonly used to identify students for early reading 
intervention (i.e., measures of letter knowledge and segmenting skills) miss students who 
develop reading difficulties as early as mid-first grade. 

Our study also included students whose reading achievement through the end of 
first grade was average on standardized measures (e.g., WRMT) and reading rate, who 
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nevertheless encountered difficulties with multisyllabic words and comprehension in 
second grade that indicated a need for intensive intervention. These issues raise the 
possibility that studies in which students were identified as high-risk only in kindergarten 
or first grade—and who then become the sample that was followed longitudinally— 
might amplify the likely results of implementing an Rtl model on decreasing the 
proportion of students at risk. 

Whether Rtl should (or be expected to) reduce the proportion of students 
identified for special education has also received considerable discussion (Fuchs, 2003; 
Wanzek & Vaughn, 2011). In our study early intervention did not reduce the proportion 
of students placed in special education, but may have reduced bias associated with 
placing students with LD in special education. More and longer studies that include all 
students in schools (rather than identifying only those who appear at risk in the first year 
or two of schooling) are needed to demonstrate whether the patterns found here will be 


consistent. 
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Table 1 

Student Demographics by School 


Percentage 



Free & 

Parents not 

Students Scoring Proficient 

English 


Reduced 

High School 

or above on CST ELA 

Language 


Lunch 2 

Graduates 




Learners 

Grade 



2 nd 

3rd 

4 th 


School 1 

56 

41 

45 

22 

44 

50.92 

School 2 

69 

39 

23 

17 

34 

60.17 

School 3 

89 

36 

38 

25 

30 

38.53 

School 4 

86 

27 

31 

21 

38 

38.24 

School 5 

95 

87 

31 

22 

33 

44.83 


Note: CST ELA = California State Test, English Language Arts. 


2 2008 Growth Academic Performance Index (API) Report/Demographics 
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Table 2 


The Number of Students Receiving Tier 2 Intervention by Cohort, Year of Access, ELL 
Status (in brackets) and Grade of Treatment 



Kindergarten Cohort 

1st Grade Cohort 


Access in: 

K 

Grade 1 

Grade 1 

Totals 

New Kindergarten Start 

40 [25] 

XXX 

XXX 

40 [25] 

Kindergarten Exit 

8 [7] 

XXX 

XXX 

8 [7] 

New Lirst Grade Start 

29 [12] 

40 [23] 

30 [16] 

99 [51] 

Continued from 





Kindergarten 

32 [18] 

XXX 

XXX 

32 [18] 

Lirst Grade Total 

61 [30] 

40 [23] 

30 [16] 

128 [69] 

Lirst Grade Exit 

11 [6] 

11 [4] 

5 [4] 

27 [14] 

New Second Grade Start 

6 [0] 

7 [5] 

62 [30] 

75 [35] 

Continued from Lirst Grade 

50 [24] 

29 [19] 

25 [11] 

101 [54] 

Second Grade Total 

56 [24] 

36 [24] 

87 [41] 

176 [89] 

Second Grade Exit 

15 [5] 

13 [8] 

7 [6] 

35 [19] 

Total Exiters 

34 [18] 

24 [12] 

12 [10] 

70 [40] 

Total Continuing in RTI 

41 [19] 

23 [16] 

80 [36] 

144 [71] 


Note: Numbers in brackets are students who are English Language Learners 
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Table 3 


Criteria for Student Entry into Tier 2 and Exit Criteria Across Grade Levels 


Criteria to: 

Kindergarten 

Grade 1 

Grade 2 

Enter 

Exit 

Enter 

Exit 

Enter 

Exit 

Enter 

Exit 

Enter Exit 

Measure 

Fall 

Winter 


Fall 


Winter 

Fall 


Winter 

Initial Sounds 

<6 










Letter Naming 

<8 

< 15 

>40 

<45 


<45 





Segmenting 


<7 

>30 

<25 


<30 





Nonsense Words 




<25 


<50 





Word Identification 




<8 







Oral Reading 3 





>20 

<7 

>39 

<39 

>68 

< 60 >90 


3 Oral reading fluency scores are correct words per minute 
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Table 4 

Comparability: Kindergarten Pretests (all schools) and Peabody Picture Vocabulary 
Test (PPVT) 

Kindergartners with Access to Tier 2 in: 



Kindergarten 

Grade 1 

Year 1: 

(n = 226) 

(n= 183) 

Kindergarten Fall ISF 

9.89 (8.84) 

9.87 (7.29) 

Kindergarten Fall RLN 

12.32 (14.62) 

13.48 (13.65) 

PPVT SS 

86.50(12.53) 

87.71 (13.86) 


Notes: ISF = Initial Sounds Fluency, RLN = Rapid Letter Naming, SS = Standardized 


Score 
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Table 5 


Comparability: Tier 1 Only (Average Readers) Across Cohorts at the End of Grade 2 


Cohort 

ORF 

WRMT SS 

PPVT SS 

Kindergarten 

99.99 (26.12) 

103.00 (9.85) 

87.49(12.75) 

Grade 1 

95.55 (22.60) 

105.14(10.73) 

87.45 (11.74) 

Grade 2 

95.69 (26.84) 

104.28 (9.60) 

87.04(12.68) 


Notes: Tier 1 (Average) readers did not participate in Tier 2 intervention in any cohort 
and represent approximately the top 75% of the sample. ORF = Oral Reading Fluency, 
WRMT SS = Woodcock Reading Mastery Tests Standardized Score, PPVT SS = 
Peabody Picture Vocabulary Test Standardized Score. 


Table 6 

Results from the One-Way ANOVA Model, Outcome Variable is Grade 2 Spring ORF 


Fixed Effect 


Coefficient 

se 

Average school mean, B0 


57.75 

1.39 

Random Effect 

Variance Component 

Df 

X2 

School mean, uoj 

3.96 

4 

5.986 

Level-1 effect, n. 

485.12 
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Table 7 

Descriptive Statistics for Minutes of Tier 2 Intervention by Cohort and Access 


Kindergarten Cohort First Grade Cohort 



K Access 

Gr 1 Access 

Gr 1 Access 

Minutes in Kindergarten 

651(234) 

NA 

NA 

Minutes in Grade 1 

1073 (398) 

902 (385) 

1076 (284) 

Minutes in Grade 2 

981 (478) 

1196 (643) 

1082 (464) 


Note: Numbers reflect the average minutes in Tier 2 intervention per student, with 


standards deviations in parentheses. 
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Table 8 


Descriptive Statistics for Students Across Conditions in Grade 1 




Students at Risk 

Not at Risk 



K Access 

Grade 1 Access 


Fall 

Nonword Fluency 

23.30(13.59) 

19.35 (11.45) 

42.39 (18.72) 


Word Identification Fluency 

7.01 (5.40) 

5.37 (4.47) 

23.16(17.14) 


WRMT Word Identification 

106.07 (12.00) 

99.91 (13.64) 

119.39 (9.51) 


Word Attack 

106.59 (10.30) 

99.23 (11.65) 

115.52 (7.98) 


Passage Comprehension 

96.09 (14.56) 

92.06(10.81) 

109.62 (9.38) 


Total Reading 

101.88 (11.57) 

97.50(11.54) 

115.18 (9.14) 

Winter 

Nonword Fluency 

42.69(16.73) 

35.51 (14.74) 

61.09 (29.76) 


Word Identification Fluency 

22.68 (13.46) 

19.22(12.06) 

49.75 (21.85) 


ORF 

21.66(12.92) 

13.35(12.01) 

51.49 (25.13) 

Spring 

Word Identification Fluency 

34.09 (17.43) 

26.97 (16.53) 

60.07 (21.18) 


ORF 

32.41 (17.17) 

27.18(16.18) 

65.21 (25.92) 


WRMT Word Identification 

106.23 (10.80) 

100.75 (13.57) 

103.07 (9.65) 


Word Attack 

107.41 (9.47) 

102.99 (14.02) 

105.27 (8.56) 


Passage Comprehension 

99.68 (9.39) 

94.72(13.51) 

101.13 (9.23) 


Total Reading 

103.07 (10.38) 

97.92(11.75) 

112.23 (9.34) 


Notes: Fluency scores are items correct in one minute, ORF = Oral Reading Fluency, 


WRMT = Woodcock Reading Mastery Tests, with Standardized Scores shown. 
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Table 9 

Descriptive Statistics for Students Across Conditions in Grade 2 

K Access 
N 75 

Fall Oral Reading Fluency 33.68 (16.95) 

WRMT Word Identification 105.49 (11.25) 

Word Attack 105.22 (9.96) 

Passage Comprehension 101.32 (8.27) 

Total Reading 103.26(11.01) 

Spring Oral Reading Fluency 62.98 (23.95) 

WRMT Word Identification 101.75 (7.42) 

Word Attack 103.89 (10.44) 

Passage Comprehension 99.90 (9.62) 


Total Reading 


100.93 (7.83) 


68 


Students at Risk 
Gr 1 Access 
139 

30.40(13.76) 

102.77 (10.58) 
102.41 (10.36) 

97.23 (9.15) 
98.98 (11.45) 
60.58 (22.93) 
98.80 (8.69) 

100.78 (13.49) 
95.16 (8.97) 
97.46 (10.60) 


Historical Control 
102 

21.83 (9.68) 
93.63 (10.75) 

94.49 (9.32) 

90.49 (10.48) 
90.27(10.14) 
48.37 (19.29) 
93.80 (8.54) 
95.68 (8.34) 
92.23 (8.51) 
92.09 (8.82) 


Not at Risk 

545 

67.86 (24.89) 
109.18 (11.19) 
110.02(12.79) 
104.93 (9.72) 
106.30 (10.03) 
97.62 (25.40) 
[not administered] 
[not administered] 
[not administered] 
[not administered] 
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GORT-4 


88.44(12.04) 84.96(12.29) 71.78 (9.68) [not administered] 


Notes: WRMT = Woodcock Reading Mastery Tests, with Standardized Scores shown; GORT-4 = Gray Oral Reading Test, with 
Standardized Scores shown 
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Table 10 

Means and Standard Deviations for First and Second Grade Measures for Students Who Were English Language Learners (ELL) 




ELL Students at Risk 


EL Not At risk 

Spring, Grade 1 

K Access 

Gr 1 Access 

Control 


N 

34 

73 

35 

230 

Oral Reading Fluency 

28.61 (15.22) 

23.63 (14.12) 

[not administered] 

64.63 (25.17) 

WRMT Word Identification 

105.90(12.64) 

103.15 (10.74) 

[not administered] 

104.60(10.80) 

Word Attack 

106.74(10.74) 

103.15 (13.22) 

[not administered] 

104.30(10.68) 

Passage Comprehension 

98.16(9.78) 

97.93 (8.78) 

[not administered] 

99.70 (9.01) 

Total Reading 

103.00(11.11) 

101.00 (9.72) 

[not administered] 

114.76 (8.42) 

Fall, Grade 2 

Oral Reading Fluency 

36.71 (19.16) 

30.14(12.87) 

23.49 (11.79) 

69.33 (24.10) 

WRMT Total Reading 

104.58 (9.17) 

98.97 (11.34) 

91.77 (8.29) 

106.97 (9.53) 

Spring, Grade 2 

Oral Reading Fluency 

64.76 (22.83) 

60.47 (21.30) 

49.59 (19.77) 

100.69 (23.86) 
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WRMT Word Identification 

102.39 (6.57) 

99.61 (7.81) 

94.83 (8.22) 

109.52(11.96) 

Word Attack 

103.04(10.30) 

99.72(16.01) 

96.28 (8.99) 

110.70 11.77) 

Passage Comprehension 

99.24 (7.47) 

96.81 (7.96) 

92.76 (8.56) 

104.87 (8.58) 

Total Reading 

101.35 (7.58) 

98.05 (8.29) 

93.67 (7.15) 

109.11 (9.16) 

GORT-4 

88.54(12.03) 

86.41 (12.34) 

68.64 (9.41) 

[not administered} 


Notes: WRMT = Woodcock Reading Mastery Tests, with Standardized Scores shown; GORT-4 = Gray Oral Reading Test, with 
Standardized Scores shown 
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Table 11 


Reading Outcomes for Students Placed in Special Education * by Cohort and English 
Language Learner (ELL) Classification 



At-Risk 

Control 


K Access 

Grl Access 

At-Risk (no access) 

No Reading risk 

Total N* 

4 

7 

10 

10 

ELL N* 

2 

4 

1 

2 

ORE 

Total 

19.75 (11.35) 

46.83** (27.18) 

34.10(18.43) 

105.5 (18.48) 

ELL 

20.50(17.68) 

34.33 (34.56) 

42 

100 (29.70) 

WRMT Total 

Total 

89.00(11.69) 

92.70 (6.82) 

84.90 (8.82) 

104.5 (12.17) 

ELL 

87.00 (19.80) 

92.23 (10.26) 

87 

107 (11.31) 


*Students with moderate to severe disabilities, including Hearing Impairment, 
Moderate Cognitive Impairment, and Austim were excluded from these numbers and 


scores. 



